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' Abstract. We use a randomised embedding method to prove that for all a > any 

sufficiently large oriented graph G with minimum in-degree and out-degree S + (G), S~(G) > 
(3/8 + contains every possible orientation of a Hamilton cycle. This confirms a 

conjecture of Haggkvist and Thomason. 
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O ! 1. Introduction 

An oriented graph is a loop- free simple graph where each edge is given an orientation. A di- 
rected graph (digraph) is an oriented graph where we allow one edge in each direction between 
each pair of vertices, that is, we allow cycles of length 2. The minimum semi-degree 5 (G) 
of an oriented graph G (or a digraph) is the minimum of its minimum outdegree S + (G) and 
its minimum indegree 5~(G). 
£\\ • A fundamental result of Dirac states that a minimum degree of \G\/2 guarantees a Hamil- 

ton cycle in any undirected graph G on at least 3 vertices. Following this result several 
weaker conditions guaranteeing a Hamilton cycle have been found. One of the famous of 
these is Ore's theorem, which states that if d(x) + d(y) > \G\ > 3 for all x ^ y G V(G) with 
xy E(G) then G contains a Hamilton cycle. In some sense the weakest possible condition 
£ — ■ ■ of this type is Chvatal's theoremQ This gives a condition on the (ordered) degree sequence 

of a graph which forces a Hamilton cycle, such that for any (graphic) degree sequence not 
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satisfying Chvatal's conditions there exists a graph with a degree sequence dominated by 
that sequence not containing a Hamilton cycle. 

There is an analogue of Dirac's theorem for digraphs due to Ghouila-Houri [5J which 
^ ■ states that every digraph D with minimum semi-degree at least \D\/2 contains a directed 

Hamilton cycle. As with Dirac's theorem, taking two disjoint cliques of as equal size as 
possible demonstrates that this minimum degree condition can not be improved. 

Thomassen [23J asked the natural question of whether there exists an analogous result 
for oriented graphs, where one expects to be able to obtain a weaker degree condition than 
the bounds needed for digraphs. Haggkvist [7] constructed an example in 1993 showing 
that a minimum semi-degree of (3n — 4)/8 was necessary and conjectured that this was also 
sufficient. With Thomason [9] he showed in 1997 that for any a > every sufficiently large 
oriented graph G with minimum semi-degree at least (5/12 + a)|G| has a directed Hamilton 
cycle. The author, together with Kiihn and Osthus [II], finally confirmed in 2008 that, up 
to a linear error term, 3|G7|/8 is indeed the correct bound. Following this Keevash, Kiihn 
and Osthus improved this to an exact result. 

Theorem 1 (Keevash, Kiihn and Osthus [H]). There exists no such that every oriented 
graph G on n > uq vertices with 5°(G) > (3n — 4)/8 contains a directed Hamilton cycle. 



^Whilst it is widely regarded as such, it should be noted that Chvatal's theorem does not quite imply 
Ore's theorem. 
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Christofides, Keevash, Kiihn and Osthus [3j have also since found an efficient algorithmic 
proof of (a generalisation of) this result. 

Nash- Williams [20] conjectured a digraph analogue of ChvataFs theorem. This has re- 
cently been approximately confirmed by Kiihn, Osthus and Treglown [18]. There also now 
exists a semi-exact degree condition result due to Christofides, Keevash, Kiihn and Osthus 

Efl- 

It is natural to ask whether these bounds only give us directed Hamilton cycles or whether 
they give every possible orientation of a Hamilton cycle. Indeed this question was answered 
for digraphs, asymptotically at least, by Haggkvist and Thomason in 1995. 

Theorem 2 (Haggkvist and Thomason [8j). There exists uq such that every digraph D 
onn > no vertices with minimum semi-degree 5°(D) > n/2 + n 5 / 6 contains every orientation 
of a Hamilton cycle. 

The question was asked originally for oriented graphs by Haggkvist and Thomason [9] who 
proved that for all a > and all sufficiently large oriented graphs G a minimum semi-degree 
of (5/12 + a)|G| suffices to give any orientation of a Hamilton cycle. They conjectured 
that (3/8 + 0)1(71 suffices, the same bound as for the directed Hamilton cycle up to the error 
term a\G\. Whilst not asked explicitly before Haggkvist and Thomason's paper, there is 
some previous work of Thomason and Grant relevant to this area. Grant [6] proved in 1980 
that any digraph D with minimum semi-degree 6°(D) > 2|D|/3 + \f\D\ log \D\ contains an 
anti-directed Hamilton cycle, provided that n is even. (An anti-directed cycle is one in which 
the edge orientations alternate.) Thomason [22] showed in 1986 that every sufficiently large 
tournament contains every possible orientation of a Hamilton cycle (except possibly the 
directed Hamilton cycle if the tournament is not strong). The following theorem confirms 
the conjecture of Haggkvist and Thomason. 

Theorem 3. For every a > there exists an integer hq = no(a) such that every oriented 
graph G on n > hq vertices with minimum semi-degree 6°(G) > (3/8 + a)n contains every 
orientation of a Hamilton cycle. 

1.1. Robust Expansion. The property underlying the proofs of all the recent Hamilton 
cycle results so far stated is robust expansion. This is a notion which was introduced by Kiihn, 
Osthus and Treglown in [18] and has proved to be the correct notion of expansion in a digraph 
when dealing with this kind of question or when using the Diregularity lemma. Informally 
speaking, a digraph G is a robust outexpander if all subsets of V(G) have outneighbourhoods 
larger than themselves unless they are very large or very small and, moreover, this still holds 
after the removal of a small number of edges. 

Having a minimum semi-degree 5°(G) > (3/8 + a)|G| for some a > 0, satisfying an ap- 
proximate Ore-type condition or satisfying an approximate Chvatal condition imply robust 
outexpansion (see Lemma 11 in [18]). Hence an extension of Theorem [3] to robust out- 
expanders would imply an approximate Ore-type result and a Chvatal-type approximate 
result for arbitrary orientations of Hamilton cycles. The author believes it is likely that the 
argument given in this paper could be straight-forwardly extended to prove this. 

1.2. Extremal Example. Haggkvist [7] constructed an example in 1993 giving a graph 
on n = 8k — 1 vertices with minimum semi-degree (3n — 5) /8 containing no Hamilton cycle 
and Keevash, Kiihn and Osthus extended this to all n. This means that Theorem Q] is best 
possible and that Theorem [3] is best possible up to the linear error term. Interestingly, this 
example can be improved upon when considering arbitrary orientations. Hence the additive 
constant in Theorem [1] is not the correct bound when seeking any orientation of a Hamilton 
cycle, and it is an open question as to what the correct additional term should be. 
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Figure 1. The oriented graph constructed in Proposition [J] 

Proposition 4. There are infinitely many oriented graphs G with minimum semi-degree 
exactly (3|G| — 4)/8 which do not contain an anti-directed Hamilton cycle. 

Proof. Let n := 8m + 4 for some integer m £ N. Let G be the oriented graph obtained 
from the disjoint union of two regular tournaments A and C on 2m + 1 vertices and sets B 
and D of 2m + 1 vertices by adding all edges from A to B, all edges from B to C, all edges 
from C to D and all edges from D to A. Finally, between B and D we add edges to obtain 
a bipartite tournament which is as regular as possible, i.e. the indegree and the outdegree of 
every vertex differs by at most 1. So in particular every vertex in B sends at least m edges 
to D. It is easy to check that the minimum semi-degree of G is 3m + 1 = (3n — 4)/8, as 
required. 

Let us try to construct an anti-directed Hamilton cycle in G and let us start in B with an 
edge going forwards. This edge can go either to C or to D. (Starting with an edge oriented 
backwards produces an identical argument and result.) The next edge must go backwards. 
It can go from C to either B or C. It can go from D to either B or C. So after two steps 
we can be in either B or C. Our next edge must go forwards. If we are in B our possible 
locations after the next two steps are B and C as before. From C we can go forwards 
either to C or to D. Both options repeat situations we have already met. In no case do 
we have a means to reach A whilst respecting the orientation of our anti-directed Hamilton 
cycle. Hence the longest anti-directed cycle in G has length at most 3n/4 and we have no 
anti-directed Hamilton cycle as claimed. □ 

1.3. Pancyclicity. Recently the author, together with Kiihn and Osthus, [13] showed that 
the minimum semi-degree condition in Theorem [1] gives not only a Hamilton cycle but a 
cycle of every possible length. It is natural to ask whether this can be extended to give all 
orientations of all cycles of all possible lengths. A simple probabilistic argument implies that 
Theorem [3] gives arbitrary orientations of any cycle of linear length (i.e. for all a > 0, n > 
every sufficiently large oriented graph G with 5°(G) > (3/8+a)|G| contains every orientation 
of any cycle of length at least It remains an open question as to whether the error 

term can be removed. The results on short cycles necessary to prove the exact pancyclicity 



1 



LUKE KELLY 



result in [13] can (with the addition of an error term in the minimum semi-degree condition) 
also be extended to arbitrary orientations of cycles. In particular, the following theorem can 
be obtained. 

Theorem 5. Let a > 0. Then there exists no = no(ct) such that if G is an oriented graph 
on n > no vertices with minimum semi-degree 5°(G) > (3/8 + a)n then G contains a cycle 
of every possible orientation and of every possible length. 

A proof of Kelly, Kiihn and Osthus of this result and extensions of the stronger results 
on some short cycles can be found in [13] . along with a number of related open problems. 

1.4. Overview of the Proof. The proof of Theorem [3] splits into two parts, both relying 
on the expansion properties that our minimum semi-degree condition implies. The cases 
are distinguished by the similarity of the Hamilton cycle C we are trying to embed to the 
standard orientation of a Hamilton cycle. It turns out that the correct measure, at least 
for this problem, of whether a cycle is close to a directed cycle is the number of pairs 
of consecutive edges with different orientations. Given an oriented graph C we call the 
subgraph induced by three vertices x,y,z G V(C) a neutral pair if xy,zy € E(C). Given 
an arbitrarily oriented cycle Coin vertices let n(C) be the number of neutral pairs in C. 
Write C* for the standard orientation of a cycle on n vertices. When there is no ambiguity 
we will merely write C* . 

The essential idea is to split the cycle up into alternating short and long paths and 
use the probabilistic method to find an approximate embedding of the long paths into a 
Hamilton cycle of the reduced graph created by applying a Regularity lemma for digraphs. 
We connect these paths up greedily using the short paths and then adjust the embedding 
to obtain something which, after the Blow-up lemma has been applied, gives us the desired 
orientation of a Hamilton cycle in our graph. 

The case distinction comes in the manner in which we alter our embedding. In Section[7]we 
give the argument for cycles far from C*, where we use the neutral pairs for our adjustments. 
In Section Owe assume that we have few neutral pairs, and thus many long sections of C 
containing no changes in direction, and use these to adjust our embedding. 

The number of exceptional vertices that the Diregularity lemma produces when applied 
directly is too great for the method used here and hence some technical difficulties are 
introduced. So we control the number of exceptional vertices by randomly splitting our 
oriented graph G. In still vague, but slightly more precise terms, the Diregularity lemma 
will for any e > give us a partition with the property of e-regularity. It will also give us a set 
of 'exceptional vertices' which are in some sense badly behaved, but tells us that these make 
up at most an e proportion of our vertices. Our method can only cope with nn <C en such 
vertices. Hence we split the vertices of our given graph G into two sets A and B of roughly 
equal size (satisfying some 'nice' properties). We apply the Regularity lemma to G [.£?], giving 
us at most e\G\ exceptional vertices Vq. We then apply the Diregularity lemma to G[AU Vq] 
only this time not with parameter e but with 77. This gives us at most ij\G\ exceptional 
vertices Vq. We then consider Gb '■= G[{B \ Vq) U Vq], which is e-regular and has no 
exceptional vertices and Ga ■ = G — Gb, which is ^-regular and has <C "|GU| exceptional 
vertices. Hence, at the cost of some technical work and having to stitch everything back 
together we will be able to control the number of exceptional vertices. 

The next section contains much of the notation we use in this paper. In Section [3] we 
introduce the forms of the Diregularity lemma and Blow-up lemma that we need later. In 
Section [6] we prepare the oriented graph G and the cycle C for our approximate embedding 
and in Section [5] prove the main tool needed to do this. Following that in Section [6] we split 
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into our two cases and in Section [7] (G is far from C*) and Section [8] (G is close to G*) we 
prove Theorem [31 

2. Notation 

Given two vertices x and y of a digraph G, we write xy for the edge directed from x to y. 
The order |G| of G is the number of its vertices. We write Nq{x) for the outneighbourhood 
of a vertex x and d + {x) := l-ZV^x)! for its outdegree. Similarly, we write Nq{x) for the 
inneighbourhood of x and d~(x) := |A^^(x)| for its indegree. Given X C V(G) we denote 
|A^(a;) nXj by gT£(x), and define d^ c (x) similarly. We write Nq(x) := Nq(x) U Nq(x) for 
the neighbourhood of x. We use N + (x) etc. whenever this is unambiguous. We write A(G) 
for the maximum of |-/V(x)| over all vertices x E V(G). Given a set A of vertices of G, we 
write Nq(A) for the set of all outneighbours of vertices in A. So Nq(A) is the union of 
Nq(cl) over all a £ A. Nq(A) is defined similarly. The directed subgraph of G induced 
by A is denoted by G[A] and we write e(A) for the number of its edges. G — A denotes the 
digraph obtained from G by deleting A and all edges incident to A. 

Given two vertices x,y of a digraph G, an x-y path is a path with any orientation which 
joins x to y. We call a path with the standard orientation a directed path. Given two 
subsets A and B of vertices of G, an ^4- 1? edge is an edge ah where a G A and b £ B. We 
write e(A, S) for the number of all these edges. A walk in G is a sequence vif 2 • • • vi of (not 
necessarily distinct) vertices, where tw+i or Vi+\Vi is an edge for all 1 < i < I. The length 
of a walk W is := £ — 1. The walk is closed if ui = t^. Given two vertices x, y of G, the 

distance dist(x, y) from x to y is the length of the shortest directed x-y path. The diameter 
of G is the maximum distance between any ordered pair of vertices. 

We write [k] for the set {1,2,..., k}. We write < a\ <C 02 <C ■ ■ • <C cifc to mean that we 
can choose the constants ai, 02 j ■ • • > Ofe from right to left. More precisely, there are increasing 
functions fi, fz, • • • > fk-1 such that, given a^, whenever we choose some < /j(a,, + i), all 
calculations needed using these constants are valid. 

3. The Diregularity lemma and the Blow-up lemma 

In this section we collect all the information we need about the Diregularity lemma and 
the Blow-up lemma. See [16] for a survey on the Regularity lemma and [15] for a survey 
on the Blow-up lemma. We start with some more notation. The density of an undirected 
bipartite graph G = (^4, B) with vertex classes A and B is defined to be 

e G (A,B) 



d G (A,B) 



\A\\B\ 



We often write d(A, B) if this is unambiguous. Given e > 0, we say that G is e-regular if 
for all subsets X C A and Y C B with \X\ > e\A\ and \Y\ > e\B\ we have that \d(X,Y) - 
d(A, B)\ < e. Given d G [0,1] we say that G is (e,d)- super-regular if it is e-regular and 
furthermore d B (a) > [d - e)\B\ for all a G A and cU(&) > [d - e)\A\ for all b £ B. (This 
is a slight variation of the standard definition of (e, (i)-super-regularity where one requires 
d B (a) > d\B\ and d A (b) > d\A\.) 

The Diregularity lemma is a version of the Regularity lemma for digraphs due to Alon 
and Shapira [I]. Its proof is quite similar to the undirected version. We will use the degree 
form of the Diregularity lemma which can be easily derived (see e.g. [23]) from the standard 
version, in exactly the same manner as the undirected degree form. 

Lemma 6 (Degree form of the Diregularity lemma). For every e G (0,1) and every inte- 
ger M' there are integers M and uq such that if G is a digraph on n > no vertices and 
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d £ [0, 1] is any real number, then there is a partition of the vertices of G into Vq,V\, . . . ,Vk 
and a spanning subdigraph G 1 of G such that the following holds: 

• M' <k < M, 

• \V \<en, 

• \V 1 \ = --- = \V k \=:m, ^ 

• d~Q,(x) > d~^{x) — (d + e)n for all vertices x £ G, 

• dQ,(x) > dQ{x) — (d + e)n for all vertices x £ G, 

• for every ordered pair ViVj with 1 < i,j < k and i ^ j the bipartite graph (Vi, 
whose vertex classes are Vi and Vj and whose edge set consists of all the Vi-Vj edges 
in G' is e-regular and has density either or at least d, 

• for all 1 < i < k the digraph G'[Vi] is empty. 

Vi, . . . ,Vk are called clusters, Vq is called the exceptional set and the vertices in Vq are 
called exceptional vertices. Note that in G' all pairs of clusters are e-regular in both directions 
(but possibly with different densities). We call the spanning digraph G' C G given by the 
Diregularity lemma the pure digraph. Given clusters Vi,...,Vk and the pure digraph G' , the 
reduced digraph R' is the digraph whose vertices are V±, . . . , \% and in which ViVj is an edge 
if and only if G' contains a Vi-Vj edge. Note that the latter holds if and only if (Vi, Vj)c' is 
e-regular and has density at least d. It turns out that R' inherits many properties of G, a 
fact that is crucial in our proof. However, R' is not necessarily oriented even if the original 
digraph G is. The following straightforward lemma, taken from a paper of Kelly, Kiihn and 
Osthus [H], shows that by discarding edges with appropriate probabilities one can go over 
to a reduced oriented graph R C R' which still inherits many of the properties of G. 

Lemma 7. For every e £ (0, 1) there exist integers M' = M'(e) and uq = no(e) such that 
the following holds. Let d £ [0, 1], let G be an oriented graph of order at least no and let R' 
be the reduced digraph and G' the pure digraph obtained by applying the Diregularity lemma 
to G with parameters e, d and M' . Then R' has a spanning oriented subgraph R with 

(a) 8+{R) > (6+(G)/\G\ - (3e + d))\R\, 

(b) 5-(R) > (5-(G)/\G\ - (3e + d))\R\, 

(c) 5°(R) > (S°(G)/\G\ - (6e + 4d))\R\. 

The oriented graph R given by Lemma [7] is called the reduced oriented graph. The span- 
ning oriented subgraph G* of the pure digraph G' obtained by deleting all the Vi-Vj edges 
whenever ViVj £ E(R') \ E(R) is called the pure oriented graph. Given an oriented subgraph 
S C R, the oriented subgraph of G* corresponding to S is the oriented subgraph obtained 
from G* by deleting all those vertices that lie in clusters not belonging to S as well as 
deleting all the Vi-Vj edges for all pairs Vi, Vj with ViVj ^ E(S). 

At various stages in our proof we will need some pairs of clusters to be not just regular 
but super-regular. The following well-known result tells us that we can indeed do this whilst 
maintaining the regularity of all other pairs. 

Lemma 8. Let eCd, 1/A and let R be a reduced oriented graph of G as given by Lemmas® 
and [3 Let S be an oriented subgraph of R of maximum degree A. Then we can move 
exactly 2Ae\Vi\ vertices from each cluster into Vo such that each pair (Vi, Vj) corresponding 
to an edge of S becomes (2s, d/ 2) -super-regular and every pair corresponding to an edge 
of R\S becomes 2e-regular with density at least d — e. 

In our proof of Theorem [3] we will also need a consequence of the Blow-up lemma of 
Komlos, Sarkozy and Szemeredi |T7j. Roughly speaking, it says that an r-partite graph 
formed by r clusters such that all the pairs of these clusters are (e, (f)-super-regular behaves 
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like a complete r-partite graph with respect to containing graphs of bounded maximum 
degree as subgraphs. 

Lemma 9 (Blow-up Lemma, Komlos, Sarkozy and Szemeredi [H]). Given a graph F on 
[k] and positive integers d and A there exists a positive real Eq = eo(d,A,k) such that the 
following holds for all positive numbers and all < e < £q. Let F' be the graph 

obtained from F by replacing each vertex i € F with a set Vi of li new vertices and joining all 
vertices in Vi to all vertices in Vj whenever ij is an edge of F. Let G' be a spanning subgraph 
of F' such that for every edge ij € F the graph (Vi,Vj)c' is (e, d) -super-regular. Then G' 
contains a copy of every subgraph H of F' with maximum degree A (if) < A. Moreover, this 
copy of H in Q maps the vertices of H to the same sets Vi as the copy of H in F' , i.e. if 
h € V(H) is mapped to V, by the copy of H in F' , then it is also mapped to Vi by the copy 
of H in G'. 

The tool we shall actually use is the following consequence of the Blow-up lemma. The 
proof of it uses similar ideas to those in recent work of Christofides, Keevash, Kiihn and 
Osthus pg. 

Lemma 10. Suppose that all the following hold: 

• < l/m < e < d < 1. 

• Ui, . . . , Uk are pairwise disjoint sets of size m, for some k > 6, and G is a digraph 
onU\U. . .UUk such that each {Ui, Ui + i)c is (s, d) -super-regular (where by convention 
we consider U^+i to be U\); 

• Ai, . . . , Ak are pairwise disjoint sets of vertices with (1 — e)m < \Ai\ =: rrtj < m 
and H is a digraph on A\ U ... U A^ which is a vertex-disjoint union of paths of 
length at least 3, where every edge going out of Ai ends in Ai + i for all i; 

• Si C Ui, . . . , Sk C Uk are sets of size \Si\ = mn 

• For each path P of H we are given vertices xp,yp £ V{G) such that if the initial 
vertex ap of P belongs to Ai then xp € Si and if the final vertex bp of P belongs 
to Aj then yp £ Sj, and the vertices xp,yp are distinct as P ranges over the paths 
ofH. 

Then there is an embedding of H into Gs '■= G[U Si] ^ n which every path P of H is mapped 
to a path that starts at xp and ends at yp. 

The following immediate consequence of the Blow-up lemma is needed in the proof of 
Lemma [TUl 

Lemma 11. For every < d < 1 and p > 4 there exists £o > such that the following holds 
for < e < sq. Let U\, . . . ,U P be pairwise disjoint sets of size m, for some m, and suppose G 
is a graph on Ui U . . .UU p such that each pair (Ui, Ui + i), 1 < i < p— 1 is (e, d)- super-regular. 
Let f : Ui —s- Up be any bijective map. Then there are m vertex- disjoint paths from U\ to U p 
so that for every x € Ui the path starting at x ends at fix) € U p . 

We also need the following random partitioning property of super-regular pairs which 
says that with high probability (i.e. with probability tending to 1 as m — > oo) all new pairs 
created by a random partition of a super-regular pair are themselves super-regular. 

Lemma 12. Suppose that the following hold. 

• 0<e<9<d< 1/2, k > 2 and for 1 < i < k we have ai, b% > 9 with ^i=i a i = 

Ztih = i. 

• G = (A,B) is an (e,d)- super-regular pair with \A\ = \B\ = m sufficiently large. 



s 



LUKE KELLY 



• A = A\ U . . . U Ak and B = B% U . . . U are partitions chosen uniformly at radnom 
with \AA = aim and \Bi\ = biiri for 1 < i < r. 
Then with high probability (Ai,Bj) is (9~ 1 e,d/ 2) -super-regular for every 1 < i,j < k. □ 

With these tools we can now prove Lemma [TT)1 

Proof, [of Lemma [10] Enumerate the paths of H as Pi, . . . , P p and split them up arbitrarily 
into paths of length 3, 4 or 5 such that Pi becomes Pi i, ■ ■ ■ , Pi,q v Let Oj j and bij be the 
initial vertex and the final vertex of Pj respectively. Then cij = for 2 < j < qi. 

Let E s consist of all belonging to the cluster A s and similarly let F s consist of all bij 
belonging to the cluster A s . For each cij € E s pick a distinct vertex Xjj £ and for 
each 6jj 6 P s pick a distinct vertex yij E such that if Ojj = hj-i then Xjj = yij-i, 
Xi,i = xp tj and yj )mi = VP.j- It is sufficient to show that there is an embedding of H in 
which each path Pjj is mapped to a path in Gs starting at Xij and ending at yij. 

For a path Pjj encode whether each edge in Pjj goes forwards or backwards. If Pj has 
length 3 then, writing f for an edge going from some Ag to Ai+\ and b for an edge going 
from Aj> to Af—i, t encodes one of the following 2 3 = 8 possibilities: 



Similarly there are 2 4 possibilities for paths of length 4 and 2 5 for those of length 5. We 
divide the paths Pij into 56A: subcollections based on the orientations of their edges. It 
transpires that there are notational advantages in doing this by encoding the destination of 
each vertex relative to the first. More precisely, we divide the paths into subcollections Va 
with 1 < % < k, 3 < t < 5 and 



encoding one of the 2 3 + 2 4 + 2 5 = 56 possibilities discussed above and the length £ = £(t) 
of the paths. Note that we always have t(0) = 0. For example, a path oriented f fb would 
have t : (0, 1, 2, 3) i— > (0, 1, 2, 1). V^t contains all paths Pij of length i starting in Ai with 
each vertex in Pjj going to the cluster relative to Ai given by t. 

Observe that as \Ui\Si\ < em, every pair (Si, Si + i) is (2e, <i/2)-super-regular. We first use 
a greedy algorithm to sequentially embed those collections Vij containing at most d 2 m paths. 
That is, we pick any |Pi,t| vertices in Si to be the start of these paths, and then construct 
these paths by selecting any (distinct) neighbours of these vertices in the Sj appropriate for 
each vertex in each path. Each set Si is met by at most 11 x 56 of the collections so at 
any stage in this process we have used at most 6 x 11 x 56d 2 m vertices from any cluster Ui. 
As we have (i C 1 the restriction of any pair (S^, Si+i) to the remaining vertices is still 
(4e, <i/4)-super-regular and so we can indeed do this. 

Having embedded all the V%t containing few paths, we randomly split the remaining 
vertices so that for each large V% t t we have sets Sf t C S i+t ^ =i , S} t C S i+t ^, . . . , Sf t C 
£>i+M) each of size \Vi,t\ > d 2 m. By Lemma [T2l for each large collection V%f, and for all < 



r < £ - 1 the pair (S r i>t , if t(r + 1) > t(r) or the pair (S]j\ S£ t ) if t(r + 1) < t(r) 



is (4<i 2 e, <i/8)-super-regular with high probability. Thus for sufficiently large m we can 
choose a partition with this property and apply Lemma [UJ to embed each large V% t within 



fff ffb fbf fbb bff bfb bbf bbb. 



t:{0,l,...,£}^ {-£,-£ + 1,... 



(} 



its allocated sets. 



□ 



4. Skewed Traverses and Shifted Walks 



In this section we introduce some tools needed to tweak a random embedding of an 
arbitrarily oriented Hamilton cycle into a directed Hamilton cycle of the reduced oriented 
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graph to make it correspond (in some sense) to the desired orientation of a Hamilton cycle 
in our original graph. 

The following crucial result says that our minimum semi-degree condition implies outex- 
pansion. 

Lemma 13 (Kelly, Kiihn, Osthus [14J). Let R be an oriented graph with 6°(R) > (3/8+a)\R\ 
for some a > 0. If X C V(R) with < \X\ < (1 - a)\R\ then \N+(X)\ > \X\+a\R\/2. 

Suppose that F is a Hamilton cycle (with the standard orientation) of the reduced oriented 
graph R and relabel the vertices of R such that F = V1V2 ■ ■ ■ Vm, where we let M := \R\. 
Create a new digraph R* from R by adding all the exceptional vertices v G Vb to V(i?) and 
adding an edge vVi (where Vi is a cluster containing m vertices) whenever |iV^(v)| > cm for 
some given constant c > 0. (Recall that m denotes the size of the clusters.) The edges in R* 
of the form V{V are defined in a similar way. Let G c be the digraph obtained from the pure 
oriented graph G* by making all the non-empty bipartite subgraphs between the clusters 
complete (and orienting all the edges between these clusters in the direction induced by R) 
and adding the vertices in Vb as well as all the edges of G between Vb and V(G — Vb). 

Let W be an assignment of the vertices of an arbitrarily oriented cycle Conn vertices to 
the vertices of R* which respects edges (i.e. is a digraph homomorphism from C to R*). We 
denote by a(i) the number of vertices of C assigned to the cluster V,. Observe that we can 
think of W either as a (possibly degenerate) embedding into G c or as a closed walk in R*. 
It will be useful to the reader to keep this duality in mind when reading the rest of the proof 
We say that an assignment W of C to R* is ^-balanced if maxj \a(i) — m\ < jn and balanced 
if a(i) = m for all 1 < i < M. Furthermore, we say that an assignment (7, //)- corresponds 
to C if the following conditions hold. 

• W is 7-balanced. 

• Each exceptional vertex v £ Vb has exactly one vertex of C assigned to it. 

• In every Vi £ V(R) at least m — fin of the vertices of C assigned to Vi have both of 
their neighbours assigned to Vj_i U Vi + \. 

We say that the assignment fi- corresponds to C if it (0, /^-corresponds to C. 

Once we have found such an assignment we can, with some work, use Lemma [10] to show 
that it corresponds to a copy of C in G. Our immediate aim then is to find such a closed 
walk corresponding to C. 

Given clusters V and V', a skewed V -V traverse T(V,V) is a collection of edges of the 
form 

T(V, V) := VV h , V^Vi,, V i2 _aV i3 , . . . , V it _ x V'. 

The length of a skewed traverse in the number of its edges minus one; so the length of the 
above skewed traverse is t. Suppose that we have a 7-balanced assignment W of C to R* and 
that each vertex of R has many neutral pairs of C assigned to it. We would like to make this 
a balanced embedding by modifying W . Let V^, Vj be clusters with a(i) > m and a(j) < m. 
If Vi-iVj S E(R) then we could replace one neutral pair assigned to Vi-iViVi-i in the 
embedding with Vi-\VjVi-\. This would reduce a{i) by one and increase a(j) by one. 
Repeating this process would give the desired balanced embedding. We can not guarantee 
though that Vi-\Vj £ E(R) so we are forced to use skewed traverses to achieve the same 
effect, which we are able to show always exist under certain conditions. Let 

Vi^V h , V h ^V h , Vi 2 -iV i3 , . . . , V^Vj. 

be a skewed Vi-i-Vj traverse. Then replacing neutral pairs starting at Vj_i, V^-i, • • • , Vj t _i 
with the edges in the skewed V—i-Vj traverse we reduce a(i) by one, increase a(j) by one 
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Figure 2. A skewed V-V traverse 



and crucially do not alter a(k) for any Vk £ V(R) \ {Vi, Vj}. See Figure [2] for an illustration 
of this, where the dashed edges represent the neutral pairs which will be replaced by the 
solid edges representing the edges of the skewed traverse. We always assume that a skewed 
traverse has minimal length and thus that each vertex Vi £ V(R) appears at most once as 
the first vertex of an edge in a skewed traverse. 

Given vertices V, V £ V(R) and a Hamilton cycle F of R, a shifted V-V walk S(V, V) 
is a walk of the form 

S{V,V) := V Vi x FVi x -\ V l2 FV i2 ^ . . . V H FV lt ^ V, 

where we write ViFVj for the path 

ViFVj :=VV l+1 V i+2 ...V j , 

counting indices modulo |F| = k. (The case t = 0, and thus a walk VV' , is allowed.) We 
say that W traverses F t times and always assume that a shifted walk S(V, V') traverses F 
as few times as possible. Its length is the length of the corresponding walk in R. Note that 
if we can find a skewed V-V traverse then we can find a shifted V-V walk. 

The most important property of shifted walks is that the walk W — {V, V} visits every 
vertex in R an equal number of times. Observe also that by our minimality assumption each 
vertex Vi is visited at most one time from a vertex other than Vi-\. I.e. of the t times that Vi 
is visited at most one does not come from winding around F. This fact will be useful later 
when we try and bound the number of edges of an embedding not lying on the edges of F. 

As with skewed traverses, we can use shifted walks to go from an approximate assign- 
ment W of a cycle C to a balanced assignment. Let V,Vj be clusters with a(i) > m 
and a(J) < m. If V-iVj, VjVi+i € E(R) then we could replace one section of W isomorphic 
to F by Vi-iVjVi+iFVi-i, that is, replace Vi-iVV+i by Vi-\VjVi + \. This new section has 
the same length as before and so would not alter the rest of W . Clearly we can not ensure 
that such edges always exist. Instead we use shifted walks and replace a section of the 
embedding that looks like FF . . . F with 

SiVi^V^SiVj, V+^FV^F . . . FVi-r, 

where the F . . . F in the new embedding contains the appropriate number of F to ensure 
that it is of exactly the same length as the section of the assignment it replaced. This is a 
shifted walk from to Vj, then a shifted walk from Vj to Vi + \ and then wind around F. 
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By our definition of shifted walks each cluster will have the same number of vertices assigned 
to it (except Vi-i, Vi and Vj) and the total number of vertices assigned will not be altered. 
Clearly this method needs the cycle we're trying to embed to contain many long sections 
with no changes of orientation (and oriented in the same direction as F) . In the case where 
the cycle we are trying to embed is close to C*, the standard orientation of a cycle, we are 
indeed able to ensure this. 

Corollary 14. Let R be an oriented graph on k vertices with 5°(R) > (3/8 + a)k for some 
a > and let F = V\V% . . . Vk be a directed Hamilton cycle of R. Define r := |~2/a] . Then 
for any distinct V, V' € V(R) there exists the following. 

(i) A skewed V-V traverse of length at most r. 

(ii) A shifted V-V 1 walk traversing at most r cycles. 

Proof. Let Ai be the set of vertices which can be reached from V by a skewed traverse of 
length at most i and let A~ := {V G V(R) : V i+1 G Ai}. If \Ai\ > (1 - a)k then N~(V') n 
A~ ^ and we have a skewed V-V' traverse of length i + 1. If \AA < (1 — a)k then we 
can apply Lemma [T3l (here we also need that N + (V) ^ 0) to get that |A + i| > \Ai \ + ak/2. 
Since \A r ^ 2 \ > (1 - &)k > k - \N~(V')\ we again have N~(V) n A~_ 2 ^ and hence the 
desired skewed traverse. 

This skewed traverse also gives the desired shifted walk, merely 'wind around' F after 
each edge. □ 



When linking together sections of our cycle we will sometimes need to find a path between 
two vertices which is not just short but is isomorphic to a path with given length and 
orientation. To do this we use the following lemma of Haggkvist and Thomason. 

Lemma 15 (Haggkvist and Thomason [9]). Let R be an oriented graph on k vertices 
with 6°(R) > (3/8 + a)k for some a > 0. Let 4[log 2 (l/a)] < k < ak/4 and let P be 
an arbitrarily oriented path of length k. Then, if k is large enough and V,V € V{R) are 
distinct vertices, there exists a path from V to V isomorphic to P. 



5. AN APPROXIMATE EMBEDDING LEMMA 

Our main tool in our proof of Theorem [3] is the following probabilistic result which says 
that we can assign a series of paths Pi to the vertices of a small graph R such that each 
vertex of R is assigned approximately the same number of vertices. Furthermore, we show 
that if we have a collection of subpaths of the Pi we can assure that every vertex of R is 
assigned a reasonable number of the starting points of these. When we talk about 'greedily 
embedding an oriented path Pi around a cycle F given a starting point V € V(Fy we mean 
the following. Assign the first vertex of Pi to V. Given an embedding of some initial segment 
of Pi which ends at V S V(F) assign the next vertex of Pi to either the successor or the 
predecessor of V 1 in F according to the orientation of the edge in Pj. 

Lemma 16. Let R be an oriented graph on k vertices and let F be a Hamilton cycle in R. 
Let V = {Pi, . . . , P s } be a collection of arbitrarily oriented paths on t vertices and Q be a 
collection of pairwise disjoint oriented subpaths of the Pj. Then for any 7 > and suffi- 
ciently large s there exists a map (f> : [s] — * V(R) such that if the paths are greedily embedded 
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around F with the embedding of each P(i) starting at <j>{i) then the following holds. De- 
fine a{i) to be the number of vertices in Uj=i Pj assigned to V by this embedding and de- 
fine n(i,Q) to be the number of oriented subpaths in Q starting at Vi. Then for all V £ V(R) 



(1) 
(2) 



a(.) - j 



n(i,Q)-^ 



< yst, 
< jst. 



To prove it we need the following well-known probabilistic bound (see |19j for example). 

Theorem 17. Let X be a random variable determined by s independent trials Xi, . . . ,X S 
such that changing the outcome of any one trial can affect X by at most c. Then for 
any A > 0, 

Pr(|X-E(X)| > A) <2exp (-^ 

Proof, [of Lemma [16] We construct <f) by picking each <p(i) independently and uniformly at 
random. Observe that the assignment of any one path Pj can change the number of vertices 
assigned to any vertex of R by at most t. Clearly E(a(i)) = st/k. By Theorem [T7] we have 

Pr(|o(i) - st/k\ > jst) < 2exp(-^^) = 2exp(-^) < 1/(2*;) 

for s S> k. 

A similar argument gives that the probability that n(i,Q) differs too much from the 
expected value is at most l/(2k). Thus the probability that there exists Vi which does not 
have almost the expected number of vertices or almost the expected number of starting 
points of paths in Q assigned to it by cj) is less than 1. So with positive probability a map 
constructed in this manner satisfies the conclusion of the lemma, and hence such a map 
exists. □ 



6. Preparations for the Proof of Theorem [3] 

6.1. The Two Cases. We split into two cases depending on the number of neutral pairs. 
Let G be an oriented graph on n vertices with 5°(G) > (3/8 + a)n for some constant < 
a -C 1. Let C be an orientation of a cycle on n vertices with n(C) =: An neutral pairs. 
Define the following hierarchy of constants. 

< e x < e 2 < £3 < £4 < £5 < £6 < a < 1. 

Let Q be a maximal collection of neutral pairs all at a distance of at least 3 from each other. 

If A <C £4 then let e := Eq, ea '■= £5 and £* := £4. The proof of this case is given in 
Section [8l 

Otherwise we have A 3> £3 and we set £ := £3, ea '■= £2 and e* := e%. The proof of this 
case is in Section [71 

The following two sections, where we partition G and C in preparation for our embedding, 
are common to both cases. 
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6.2. Preparing G for the Proof of Theorem [3] Define a positive constant d and inte- 
gers M' A , M' B (all functions of a) such that 

< e* < l/M' A < e A < 1/M'b «£«ti«a«l. 

Chernoff type bounds applied to a random partition of V(G) show the existence of a sub- 
set A C V(G) with (1/2 — e)n < |Aj < (1/2 — e)n such that every vertex x G V(G) satisfies 

d+(x) a |AT+(x)l q 
U n 10 _ \A\ ~ n 10 

and similarly for d~(x). Apply the Diregularity lemma (Lemma|6} with parameters e 2 , d+8e 2 
and M' B to G — A to obtain a partition of the vertex set of G — A into Mg := k > M' B 
clusters V%, . . . , V& and an exceptional set Vq. Set B := V\ U . . . U V& and tub ■= \Vi\ = 
••• = \Vk\- Let G" B := G[-B], let Rb denote the reduced oriented graph obtained by an 
application of Lemma [7] and let G* B be the pure oriented graph. By our choice of A we have 
8 + (G — A)/\G — A\ > 5 + {G)/n — a/9 and a similar bound for S~. Hence we can apply 
Lemma [7] to obtain 

So Theorem [1] gives us a Hamilton cycle Fb of Rb- Relabel the clusters of Rb so that ViVi + \ € 
E{Fb) for all i where we let V^+i := V±. We now apply Lemma[8]with Fb playing the role 
of 5*, e 2 playing the role of e and d + 8s 2 playing the role of d. This shows that by adding at 
most Ae 2 n further vertices to the exceptional set Vq we may assume that each edge of Rb 
corresponds to an e-regular pair of density at least d (in the underlying graph of G* B ) and that 
each edge in Fb corresponds to an (e, (i)-super-regular pair. Note that the new exceptional 
set now satisfies \Vq\ < en. 

Now apply the Diregularity Lemma with parameters £^/4, d + 2e 2 A and M' A to G[A U Vb] 
to obtain a partition of the vertex set of G[AU Vb] into Ma ■= I > M' A clusters V[, ■ ■ ■ , Vj 
and an exceptional set Vq. Let A' := V[ U • • • U V[ , let Ra denote the reduced oriented 
graph obtained from Lemma [7] and let G* A be the pure oriented graph. As before Lemma [JJ 
implies that 5°(R A ) > (3/8 + 3a/4)\R A \ and so, as before, we can apply Theorem [T] to find a 
Hamilton cycle Fa of Ra- Then as before, Lemma [8] implies that by adding at most e^|AuVb| 
further vertices to the exceptional set V ' we may assume that each edge of Ra corresponds 
to an e^-regular pair of density at least d and that each edge in Fa corresponds to an 
(ea, (i)-super-regular pair. Finally define Gb '■= G[B UVq] and := \Gb\ and observe that 
we now have 

(5) \Vl\<e A \AUV Q \/2<e A n B . 
In both cases of our proof we now have 

< e* < I/Ma < £a < 1/M B <e<d<q<l. 

6.3. Preparing C. We would like to divide C into a number of paths and use Lemma [TBI 
to obtain an e-balanced assignment of C to R. Since we have split our graph G into two 
parts, we have to split C into two paths Pa and Pb and embed these into (an oriented graph 
similar to) G[A'] and Gb respectively. 

Define r := 4[~log 2 (4/q)] . Lemma [15] tells us that if P is an orientation of a path of 
length r then between any two distinct vertices in V(R B ) or in V(R A ) there exists a path 
isomorphic to P. 
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Define 



a := L(logn) 2 J, t 



n 



(s + l)(r- 1) 



n 



(log n) 



s + 2 

Recall that Q is a maximal collection of neutral pairs in C all at a distance of at least 3 
from each other. If Q is large, i.e. we are in the case where C is far from C*, let v* be a 
vertex in C such that the subpath of C of length n/2 following v* and the subpath of C 
preceding v* both contain at least 2|Q|/5 elements of Q. Divide C into overlapping paths 
(by which we mean paths sharing endvertices) 

C := Q1P1Q2P2 ■ ■ ■ Qs-iPs-iQsPsQ* P* 

where their lengths satisfy £{Pi) = t, i(Qi) = £(Q*) = r and 2t < 1{P*) < 3t and Q 1 starts 
at v* . Let sb G N be such that 

Kn B -s B (t + r) <£(P*) 

and let 

Pb '■= P B QiPi • ■ ■ Qs B Ps B 
where P B is an end-segment of P* of such length as to ensure (-{Pb) + 1 = n B- Let 

P A := Q'^.-.Q^P's^Pl 

where Q[ := Q SB+ i, P! := P SB +i, sa '■= s — sb and P\ is an initial-segment of P* which 
overlaps P B in exactly one place. Observe that we now have 

(6) n B = s B t + s B r + £(P£) + l = \V(P B )\ 

and define 

n A :=n-n B = s A t + (s A + l)r + £{P* A ) + 1 = \V{P A )\ ~ 2- 
7. Cycle is Far From C* 

7.1. Approximate Embedding. First we use the probabilistic tools in Section [5] to assign 
the paths Pi to the clusters of Rb in such a way as to ensure that all the clusters are 
assigned approximately the same number of vertices and the neutral pairs are relatively 
evenly distributed. Let Qb C Q consist of all neutral pairs from Q which are contained 
in the Pi and moreover are at a distance of at least three from the ends of the Pi. Apply 
Lemma [TBI to Rb, Vb '■= {PuPz ■ ■ ■ ,Ps B } an d Qb with e* as 7 to obtain an embedding of 
the Pi into V(R B ) with 



o(i) 



s B t 



M B 



< e*s B t, 



n(i, Qb) 



\Qb\) 



M b 



< e*s B t. 



for all Vi G V(R B ). (Recall that a(i) is defined to be the number of vertices of P B assigned 
to the cluster Vi by the embedding.) In a slight abuse of notation let n(i) be the number of 
neutral pairs in Q B starting at Vi. Note that 



(7) 



am 



m B 



(6) 

< 



am 



s B t 



M B 



+ 



sbt + 3t 



M B 



< 



a(i) 



s B t 



M B 



+ e TUB- 



The requirement that the neutral pairs in Q are at a distance of at least three from 
each other means that \Q\ > n(C)/4. By the observation in Section [6.31 we know that Pb 
contains at least 2|Q|/5 > An/10 neutral pairs. The paths Qi and P g together contain fewer 
than s B r + 3t neutral pairs and at most 4s B neutral pairs can be in the P{ but within a 
distance of at most three from a Qi. Thus for all i 



n(i) > 



Xn 
10M B 



e*s B t- (s B r + 3t + 4sb) > 



Xn B 
6M B 



2e*n B > 



Xm B 
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Figure 3. Incorporating an exceptional vertex when C is far from C*. 

For all 2 < i < sb we can join Pj_i and Pi by a path in Rb isomorphic to Qi using LemmafTol 
Furthermore we can greedily extend P\ backwards by a path isomorphic to P B Q\. This will 
increase a(i) by at most ssr+St < £*m,B for n sufficiently large. We now have an assignment 
of Pb to the clusters of Rb which we can think of as a walk Wb in Rb- 



7.2. Incorporating the Exceptional Vertices. Let G° B be the digraph obtained from 
the pure oriented graph G* B by making all the non-empty bipartite subgraphs between the 
clusters complete (and orienting all the edges between these clusters in the direction induced 
by Rb) and adding the vertices in Vq as well as all the edges of G between Vq and V(Gb — Vq). 
Our next aim is to incorporate the exceptional vertices Vq into the walk Wb- We do this 
by considering the following extension of Rb- Define R* B 5 Rb to be the digraph formed 



Vq 1 and Vi 



E V(Rb), the edge vV { if 



by adding to Rb the vertices in V ' and, for v S 
\N£(v) nVi\ > am B /W and Vv if \N G (v) nV\ > am B /W- 

Then for each v G Vq pick an inneighbour Vi G V(Rb) and change the assignment of one 
neutral pair currently mapped to VVi+iVi to VivVi. We can always find such an inneighbour 
as ([3]) implies that each exceptional vertex sees at least a three-eighths proportion of the 
vertices in V(Gb)- This reduces a(i + 1) and n(i) by one. Figure [3] contains an illustration 
of this, where we consider Wb as being in G C B and the dotted lines as the section of the 
embedding to be replaced by the solid lines. After doing this for every exceptional vertex 
we will have that for all Vi E V(Rb) 



ait 



(8) 





mB\ < 



ait 



s B t 



M B 



+ e m B 



(5) 



< (e*sst + EArriB + |Vq|) + e*m B < ^eatis 



where the second term in the second line comes from greedily embedding the Qi. We also still 
have a reasonable number of neutral pairs starting at each cluster of Rb for all Vi £ V(Rb)- 



n(i) > 



\m,B 



l^o'l > 



Am 



B 



EATlB > 



Am; 
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Note that of the a(i) vertices of Pg assigned to any Vi G V(R), at most e^nB+2|l^|| < Ze^nB 
do not have their neighbours assigned to V{-i U Vj+i, where the first term came from the Qi 
and the second came from incorporating the exceptional vertices. Thus we currently have a 
(4e^4, 3ea ^corresponding embedding of Pb into R* B . 

7.3. Adjusting the Embedding. We now adjust Wb to obtain a 5£A-^B-corresponding 
assignment of Pb to R* B ; i.e. we adjust Wb to ensure that a(i) = tub for all Vi G V(Rb)- 
Recall from Corollary [14] that between any two vertices in Rb there exists a skewed traverse 
of length at most r' := [4/a]. Then for each cluster Vi G V{Rb) with a(i + 1) > ms 
pick Vj G V(-Rb) with a(j) < ms and find a skewed V^-Vj traverse of length q < r': 

ViVku Vfci-iVfca, Vfc 2 _iVfc 3 , . . . , Vk q Vk q -i, Vk q -\Vj. 

As discussed in Section H] we can use this skewed traverse to modify Wb to reduce a(i + 1) 
by one, increase a{j) by one and leave the number of vertices assigned to every other cluster 
of Rb the same. We do this by, for every < p < q, replacing a neutral pair V k \V k V kp -\ 
in W B by V kp ^ x V kp+1 V^-i where we define V ko -i := V and V kq+1 :=Vj. 

Since X]i=i \ a (f) ~ txi b\ < AsaMbtib, doing this will consume at most AsaMbub neutral 
pairs starting at any vertex of Rb- This is fine though as for all Vi G V(Rb) we have n(i) > 
Amfi/8 3> AeaMbub- Each cluster Vi now has at most 3£a^b + Aea Mbtib < ^eaMbtlb 
vertices of Pb assigned to it that do not have both their neighbours assigned to Vi-\ U Vi+i. 
Hence we have constructed a 5eA-^B-corresponding embedding Wb of Pb into R* B . 

7.4. Finding a copy of Pb in Gb- We will now use Lemma [10] to find a copy of Pb in Gb- 
To do this we use Wb to find an embedding W' B of Pb into Gb such that 

• Every vertex of Wb in Vq is unchanged in W' B . 

• Each appearance of a cluster of .Rb in Wb is replaced by a unique vertex in the 
corresponding cluster in Gb- 

• Every edge of Wb which does not lie upon an edge of Fb is mapped to an edge 
of G B - 

First we split Wb into two digraphs W B and W B . Let W B consist of all maximal walks 

Ui,lUi,2 ■ ■ ■ Ui,£i 

in Wb of length at least three whose edges all lie on Fb- Let W B consist of everything 
not in W B . Then W% is a union of walks vnVi2 ■ ■ ■ Vi ki , where we relabel if necessary to 
ensure that Ui ; i = Vi-i ;ki _ 1 and = v^i. In the next paragraph we will greedily find an 
embedding of W B into Gb which will satisfy the third requirement above. 

The walks in W B are of one of three types. The first type comes from the incorporation 
of an exceptional vertex, in which we have an exceptional vertex x G Vq and a cluster Vi G 
V(Rb) with \Nq(x) fl Vi\ > ams/lO. In this case we choose any two distinct vertices u, v G 
Nq{x) fl Vi, which we can do as there are at most |Vq| <C ems <C arris /W exceptional 
vertices. The second type comes from the paths Qi and the path P B . These we find in G* B 
(and hence in Gb 2 G* B ) greedily. We can do so as the total length of the Qi is at most 
sbt + 2t <C ems and all their edges are assigned to edges in Rb corresponding to e-regular 
pairs of density at least d in G* B . The final type are pairs of edges ij,ji with i,j G V{Rb) 
which come from the skewed traverses used to ensure that the correct number of vertices 
of Pb were assigned to each vertex of Rb- There are at most Se^M^n <C ems of these and 
so we can again find these greedily. Note that our requirement that all the neutral pairs in Q 
are at a distance of at least three from each other and the ends of the Pi implies that we have 
now considered all possible walks in W B . To satisfy the second condition above we simply 
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assign each vertex of Wb not already assigned to a (distinct) vertex in the corresponding 
cluster in Gb- As Wb is balanced (i.e. Wb assigns exactly tub vertices to each cluster) we 
can do this. 

For all i let Si consist of the vertices of Gb — Vq to which the vertices of W B that are not 
at the end of a path have been assigned. We can now apply Lemma \W\ to Gb — Vq with W B 
as H, the m t \ and as the xp and yp respectively and the S% as just defined. Combining 
this with the embedding of W B into G gives us a copy of Pb in Gb- 

7.5. Finding a copy of C in G. Recalling how we 'chopped up' C at the start of this 
section, let u, v E V(Gb) be the vertices to which the endpoints of Pb were assigned. To 
complete the proof of this case we find a copy of Pa in Ga '■= G[A' U {u, v}] starting at v and 
ending at u. We find a copy of Pa exactly as we found the copy of Pb with three differences. 
Firstly there are no exceptional vertices. Secondly, recalling that 

P A :=Q' 1 Pi...Q' SA P> A Q*PX, 

we require that the embeddings of Q'i and PjJ start and end at v and u respectively. Since 
is long enough for Lemma [151 we can specify the cluster to which its initial vertex is assigned 
and use Lemma [15] to join it to P[. We embed P\ greedily and use Q* and Lemma [TBI 
to connect it with the rest of the embedding. Hence we can indeed start and end at the 
required vertices. This doesn't affect the constants in the rest of the proof. Since the number 
of exceptional vertices and the imbalances created by the approximate embedding are both 
small (and small as functions of Ma) we can proceed exactly as before and find the desired 
cycle C in G. The calculations work as before as a result of us only having two exceptional 
vertices. The equation ([8]) becomes 

s A t 



\a(i) — tjiaI < 



an, 
1 ' M A 



e niA 



< (e*SAt + SAfnA + \{u, v}\) + e*rriA < ^£A m A- 
Hence from Section [7.41 we now have 

M A 

\a(i) -m A \< ^e A M A mA, 

i=l 

which is fine as we will have that n(i) > Am^/8 <C ^-SaMa^a for all clusters V( € V(Ra)- 

8. Cycle is Close to C* 

Our argument closely follows that in the previous section, the difference being in the 
means of correcting imbalances. To correct imbalances we will need long sections of Pb with 
no changes in orientation. Define £b '■= \^~\Mb, which is at least the maximum length of a 
shifted walk between two vertices in Rb- As before we split up C into Pa and Pb, the only 
difference being that we do not need a special vertex v* this time. Let Q' B consist be the 
largest possible collection of paths in Pb of length 3£b all at a distance of at least 3 from 
each other, oriented in the same direction and containing no changes in orientation. We will 
call these long runs. There are at least 

m(P B , Q B ) > ITf, — T~r ~ 2/ ^ n - 



U B + 6 ~ UM B 

of these in Pb- (We subtract 2An not An as both neutral pairs ViV^iVi and their in- 
verse ViVi-iVi kill possible long runs.) 

Let Qb be the subset of Q' B containing those long runs contained in the Pi, at a distance 
of at least 4 from the ends of all the P, and all oriented in the same direction. We assume 
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FIGURE 4. Incorporating an exceptional vertex when C is close to C* . 

that these are all oriented in the same direction as Pr. Keeping only long runs oriented in 
one direction loses us at most half of them. The paths Qi, the path Q*P B and the 3 vertices 
neighbouring them in the Pj in each direction can intersect at most 2s + 2 of the long runs 
and so, abusing notation slightly, 

it, \ ^ arLB o o ^ anB 
m{PB) > 2s — 2 > 



28M B ~ 30M B 

for sufficiently large n, where we recall that Vb '■= {Pi, P2 ■ • • > Ps B }- Similarly defining I a := 
[-]Ma and Q' A and Qa in the obvious way we have m{VA) > a(n^)/30M J 4. 

Apply Lemma [TBI to Rb, Qb and Vb with e* as 7 to obtain an embedding of the Pj 
into V(Rb) with 



(9) 



,.x, s B t 



M B 



, * . / .x . olu b * an B 

< £ SHE, mil) > k £ Sri > tt 

W - 30M| ~ 32M| 



for all € V(Rb), where we write m(i) for the number of elements of Qb whose initial 
vertex is assigned to Vj E V(R). 

For all 2 < i < sb we can join P%—\ and Pj by a path in Pr isomorphic to Qi using 
Lemma[l5j Furthermore we can greedily extend P\ backwards by a path isomorphic to P B Q\. 
This will increase a(i) by at most s#r + 2t < ea^b for n sufficiently large. We now have 
an embedding of Pr into Rb which we can think of as a walk Wb in Rb- 

Let G* B , G C B and R* B be defined exactly as in Section 1731 Let v G Vq be an exceptional 
vertex and let ViV,vVj € E(R* B ). (Vi and Vj exist by ([3]).) Take a long run in Qb whose 
initial vertex is currently assigned to Vi . Since Mb divides l B it also ends at Vi. We cannot 
replace the long run simply by VivVjFs ■ ■ ■ Fb because this would not end at Vi. Thus it 
would require us to alter the rest of our approximate embedding, possibly causing ([9]) to 
no longer hold. Instead we use shifted walks and a 'jump' to ensure that our modification 
incorporates v into our walk and does not alter a(i) or m(i) significantly for any cluster 
of Rb- We replace the long run starting at Vi with the following walk 

VvVjSiVj^+^FsFB . . . F B V h 

where S{Vj, is a shifted walk from Vj to The number of Fb is chosen so that 

the new section has exactly the same length as the long run it replaces. This is illustrated 
in Figure HI This is a walk that goes out to v, back to Vj, follows a shifted walk to ^4.3 
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and then winds around F until we have a walk of length 3£ B ending at Vi . This new walk 
visits Vi+i and one time fewer than previously and Vj one time more. Observe that the 
shifted walk by definition visits every cluster in R B the same number of times, which allows 
us to observe that we still end at Vi. Repeating this for each exceptional vertex creates a 
new assignment now satisfying 



am 



m B \ < 



am 



s B t 



+ 



s B r + 2t 



M 



13 



< (eAS B t + £pjn B + \ Vq\) + £Am B < 3eAn B . 
for all i. We also still have a reasonable number of long runs starting at each cluster. 



m(i) > 



an B 
32M 2 B 



tt/'l anB 



Note that of the a{i) vertices of P B assigned to Vi E V(R), at most 

£Am B + 4| Vq | < 5eaii b 

do not have their neighbours assigned to V^_i U Vi+%. The first term here comes from 
connecting the P{ and the second term from incorporating the exceptional vertices: each 
exceptional vertex has one direct edge to or from a given cluster in R B and the shifted 
walk can add at most two edges outside F B to each cluster. Thus we currently have a 
(5ea, 3eA)-corresponding assignment of P B into R* B . 



8.1. Correcting the imbalances. We now adjust our current assignment of P B to R* B to 
obtain a 156^-corresponding assignment, i.e. we adjust W B to ensure that a(i) = m B for 
all Vi € V(R B ). To do this we find a pair V{, Vj € V(R B ) such that a(i) > m B and a(j) < m B 
and replace a long run starting at Vi-i with the following walk: 

S(V5_i, VjWVjM+jFB . . . F B Vi-i, 

where the number of F B is chosen to ensure that the new section has length 31 B . This 
walk removes the assignment of one vertex to Vi, assigns one extra vertex to Vj and does not 
change the number of vertices assigned to all other clusters in R B . Since Xa=a a W = m BM B 
we can always find such a pair unless we have corrected all the imbalances. Each pair requires 
a long run and we still have at least an B / A0M B S> 3eAn B of these starting at each cluster 
and so can indeed correct all the imbalances. This leaves us with a balanced assignment 
with at most 

3sAn B + 4 • 3eAn B = lSe^ng 

edges outside F B from each vertex. Hence there are at most 15sAM B n B <S em B edges in 
total not in a path of length at least 3 all of whose edges lie on F B or not lying entirely 
on F B . This is exactly the same position as in Section [T~4l We can now proceed as before 
to first find a copy of P B in G B and then repeat the procedure with Pa (using Qa not Q B ) 
to find the desired cycle C in G. This completes this section and the proof of Theorem [3l 
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